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Abstract 


J  An  equation  that  does  not  require  tables  is  given  to  determine  a  one-sided  tolerance  limit  for 
the  100  pi!l  percentile  of  a  normal  distribution  with  confidence  1-y  for  any  sample  size  n. 
This  equation  gives  accuracy  to  approximately  three  or  more  significant  digits  when 
compared  to  tabled  values.  Thus  it  is  possible  to  develop  an  automated  procedure  for 
determining  tolerance  limits  that  is  not  restricted  to  tabled  values.  ^ _ 
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In  quality  control  applications,  the  practitioner  may  wish  to  estimate  an  extreme  (lower) 
percentile  of  the  distribution  from  a  sample.  For  example,  daily  samples  are  made  from  a 
machine's  production.  If  the  estimated  100  plh  percentile  of  some  characteristic  is  too  low. 
the  machine  is  adjusted.  Since  overestimating  the  100  p,h  percentile  defers  needed  machine 
adjustment,  it  is  often  desirable  to  be  conservative  and  use  a  lower  tolerance  limit,  to.  such 
that  the  true  population  100  plh  percentile  is  above  to  with  probability  or  confidence  1-y: 
P[P(X  §  to)  g  1  -  p)  g  l  -  y.  If  the  data  come  from  a  normal  distribution,  to  has  the  form 
x  -  Ks  where  x  is  the  sample  mean,  and  s  is  the  sample  standard  deviation.  Tables  of  K  for 
various  values  of  p,  y,  and  n  have  been  developed  (e.g.  Guttman  1970).  However,  when 
using  daily  samples,  sample  sizes  may  vary  and  it  may  be  desirable  to  vary  the  percentile 
being  estimated  or  the  desired  level  of  confidence.  If  the  process  of  determining  tolerance 
limits  is  to  be  automated,  a  method  of  determining  K  needs  to  be  found  that  does  not 
require  the  use  of  tables  and  is  not  restricted  in  sample  size  and  confidence  levels  to  tabled 
values.  Lieberman  (1958)  gives  a  formula  for  K  that  could  be  used  for  sample  sizes  larger 
than  50  (the  extent  of  his  table).  This  formula  tends  to  underestimate  K,  which  leads  to  an 
overestimation  of  the  tolerance  limit,  Iq,  and  is  contrary  to  our  goal.  This  underestimation  of 
K  becomes  more  extreme  as  the  same  size  decreases,  especially  if  the  formula  is  used  for 
sample  sizes  less  than  50.  Lieberman’s  equation  has  been  used  in  other  popular  references 
(e  g.  Natrella  1966).  The  purpose  of  this  paper  is  to  show  how  this  formula  can  be 
improved  to  give  reasonably  accurate  values  of  K  for  any  sample  size  n,  percentile  100  p, 
and  confidence  1-y. 

Given  a  sample  size  n  from  a  normal  population  N(|x,  cr2),  in  theorem  4.4  Guttman  (1970) 
shows  that 


P[P(X  e  to)  S  I  -  p]  =  P[T„.,*(Vn  Zp)  SVnK 


where  t<,  =  x  —  Ks,  zp  is  the  (1  -  p)  100th  percentile  of  the  standard  normal  distribution  and 
Tv*(8)  is  the  noncentral  Student's  t  distribution  with  v  degrees  of  freedom  and  noncentrality 
parameter  8.  Abramowitz  and  Stegun  (1972.  equation  26.7.10)  show  that  the  noncentral  t 
distribution  may  be  approximated  by  the  standard  normal  distribution  Z. 
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Therefore  K  should  be  found  such  that 
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since  I  -  y  =  P(Z  §  zT). 


Solving  for  K  yields  the  following  equation: 


zp(l  -  f)  +  jzp2  (1  -  f)2  -  1(1  -  f)2  -  z72/(2(n  -  D))(zp2  ~  z^/n)}1 
(I  -  f)2  -  z^2/<2(n  -  1|) 


where  f  =  l/(4(n  -  1)).  Lieberman’s  formula  is  based  on  equation  (2),  but  ignores  the  factor 
f.  As  n  gets  larger  this  factor  is  negligible;  but  for  small  n,  leaving  out  this  factor 
underestimates  K. 

In  order  to  determine  K  without  the  use  of  tables,  zp  and  zy  must  be  determined.  An 
approximation  for  these  quantities  may  be  found  in  Abramowitz  and  Stegun  (1972,  equation 
26.2.23): 

zp  =  t  -  (c0  +  C|t  +  c2t2)/(  1  +  d,t  +  d2t2  +  d3t3)  (3) 

where 

t  =  (in(l/p2))1/2, 
c0=  2.515517. 

c,  =  0.802853, 
c2  =  0.010328. 

d,  =  1.432788, 
d2  =  0.189269, 

and  d3  =  0.001308. 

The  error  of  the  approximation  of  zp  or  zy  is  less  than  4.5  x  104  in  absolute  value. 


A  FORTRAN  program  was  written  in  double  precision  to  compare  equation  (2)  with 
Lieberman’s  formula  and  tabled  values  (see  Guttman  1970,  table  4.6).  When  computing  K 
in  Lieberman's  formula  and  equation  (2),  approximations  for  zp  and  zy  from  equation  (3) 
were  used.  The  chosen  values  of  the  parameters  were:  p  =  .25,  .10.  .05,  .01,  .001;  y  =  .25, 
.10,  .05,  .01;  and  n  =  10  to  200  by  10.  Some  typical  results  can  be  seen  in  tables  1  and  2. 
Lieberman’s  formula  will  always  give  a  value  for  K  that  is  less  than  equation  (2)  because 
factor  f  is  missing.  The  difference  in  equation  (2)  and  Guttman’s  tabled  values  will  be  due 
to  the  error  in  the  two  approximations  used:  the  approximation  of  the  noncentral  t  by  the 
normal  (equation  (I))  and  the  approximation  of  the  normal  quantiles  (equation  (3)).  Of 
these,  the  error  in  equation  (I)  has  the  most  effect  especially  for  small  sample  sizes.  Of 
course  a  more  accurate  determination  of  K  could  be  made  using  tabled  values  of  zp  and  zr 
This  changes  the  estimates  of  equation  (2)  by  .001  or  less  in  absolute  value.  Because  of  the 
goal  of  tolerance  limits,  a  conservative  approach  would  be  to  overestimate  K.  For  this 
reason  equation  (2)  is  preferable  to  Lieberman’s  formula  as  it  either  overestimates  K  (due  to 
the  approximation  of  the  noncentral  t  distribution  by  the  normal)  or  the  underestimation  is 
less  severe.  As  n  gets  large,  either  formula  will  give  reasonable  estimates  of  K  since  the 
factor  f  =  l/(4(n  -  I ))  goes  to  zero.  If  greater  accuracy  is  desired,  and  the  1MSL  library  of 
subroutines  is  available,  the  noncentral  t  cumulative  distribution  function  routine  MDTN  can 
be  used  with  the  root  finding  routine  ZRLAL2.  The  estimate  of  K  from  equation  (2)  can  be 
used  as  an  initial  estimate.  This  procedure  gives  results  that  agree  with  Gunman's  tables. 

Thus  tolerance  limits.  t„  =  x  -  Ks.  may  be  estimated  for  samples  from  normal  populations 
without  the  use  of  tables.  Using  equations  (2)  and  (3),  K  may  be  determined  with 
approximately  three  or  more  significant  digits  of  accuracy.  When  the  estimation  of  K  has 
fewer  than  three  significant  digits  of  accuracy,  the  problem  can  be  traced  to  the  poor 
approximation  of  a  noncentral  t  distribution  for  small  degrees  of  freedom  by  the  normal 
distribution.  II  greater  accuracy  is  desired,  tables  or  IMSL  subroutines  are  needed. 
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Table  I. — Values  of  K  for  one-sided  tolerance  limits,  p  =  .05.  7  =  .25 


n 

Lieberman 

Equation  (2) 

Guttman 

to 

2  0322 

2.0995 

2.104 

20 

1  9021 

t  .93(8) 

1 .932 

30 

1.8501 

1.8674 

1.869 

40 

1  8203 

1 .8329 

1.834 

50 

1 .8(8)5 

1.8103 

1.811 

60 

1.7862 

1.7941 

1  795 

70 

1.7751 

1.7819 

1.782 

80 

1.7663 

1.7721 

1.772 

90 

1 .7590 

1  7642 

1  764 

UK) 

1.7529 

1.7575 

1.758 

lit) 

1.7477 

1.7519 

1.752 

120 

1.743 1 

1.7469 

1.747 

1.30 

1  7392 

1.7426 

1 .743 

140 

1.7356 

1.7388 

1.739 

150 

1.7324 

1.7354 

1 .735 

160 

1.7296 

1.7324 

1.732 

170 

1.7270 

1.7296 

1.730 

180 

1.7246 

1.7271 

1.727 

190 

1.7224 

1 .7248 

1 .725 

2<K) 

1 .7204 

1.7226 

1.723 

Cable  2. — Values  of  K  for  one-sided  tolerance  limits  Tor  selected  values  of  p,  y,  and  n 


P 

y 

n 

Lieberman 

Equation  (2) 

Guttman 

It) 

.25 

10 

1.6154 

1 .6683 

1.671 

10 

.25 

50 

1.4174 

1  4250 

1 .425 

10 

25 

1(8) 

1.3760 

1.3796 

1.380 

10 

.25 

218) 

1  3477 

1.3494 

1.349 

It) 

(15 

10 

2.3215 

2.4231 

2.355 

10 

05 

50 

1.6401 

1.6497 

1  646 

1(1 

1)5 

1(8) 

1  5243 

1.5285 

1.527 

10 

05 

2(8) 

1  4485 

1 .4504 

1  450 

(15 

.25 

10 

20322 

2.0995 

2,104 

05 

25 

50 

1  8(8)5 

1.810.3 

1811 

05 

25 

|(X) 

1  7529 

1.7575 

1.758 

(IS 

25 

2(8) 

1  7204 

1  7226 

1  723 

os 

(15 

10 

2.8758 

3  (8)47 

2  91  1 

05 

05 

50 

2.0590 

2.0713 

2.065 

05 

05 

too 

1  9239 

1  929.3 

1  927 

05 

05 

2oo 

1.8362 

1  8386 

1  8.37 

01 

:s 

10 

2  8235 

2  9182 

2  927 

in 

25 

so 

2  5255 

2.5371 

2.538 

01 

25 

J00 

2  4627 

2  4692 

2  470 

01 

25 

200 

2  4215 

2  4246 

2.425 

III 

os 

III 

3  94 1 2 

4  1224 

3  981 

III 

05 

so 

2  K555 

2  8725 

2.862 

01 

us 

100 

2  OKOX 

2  688.3 

2  684 

III 

05 

2oo 

2  5684 

2  5719 

2.570 
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